Introduction to the Web Archiving and Digital Libraries 2015 Workshop Issue

نویسندگان

  • Edward A. Fox
  • Zhiwu Xie
  • Martin Klein
چکیده

Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important challenge. This special issue includes summaries of twelve presentations on 24 June 2015. It is hoped that these works will stimulate other digital library (DL) and related investigations and efforts that will help ensure that broader and better archiving takes place, that more tools (beyond the excellent ones mentioned in these papers, that generally are freely available) become available, and that wider support develops to expand Web archiving. 1. WORKSHOP LOGISTICS In conjunction with the 2015 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2015, held in Knoxville, TN, USA (http://www.jcdl2015.org), WADL 2015 was held at the end of that conference. On June 24, attendees and remote participants spent a day exploring Web archiving. There were five long presentations and seven short presentations; the latter connected with a poster and demo session. To round out the day, there was an introduction, statements by participants, breaks, and a closing discussion. The three authors of this work served as co-chairs. 2. OBJECTIVES In addition to producing our website (http://www.dlib.vt.edu/WADL2015/) and this archival publication, and planning for a related journal special issue (see call at http://fox.cs.vt.edu/DL/Web-Archiving-Focused-Ed-Foxfinal-2015.pdf), this workshop strove: • to continue to build the community of people integrating web archiving & DLs; • to help attendees learn about useful methods, systems, and software in this area; • to help chart future research and improved practice in this area; • to promote synergistic efforts including collaborative projects and proposals; and • to produce an archival publication (this issue) that will help advance technology and practice. 3. DESCRIPTION WADL 2015 explored the integration of Web archiving and digital libraries, over the complete life cycle: creation/authoring, uploading, publishing in the Web, crawling/collecting, compressing, formatting, storing, preserving, analyzing, indexing, supporting access, etc. In particular, submissions were solicited on topics of interest such as: Archiving (events) Classification Crawling (focused) Databases / collections Extraction & analysis Globalization, languages Metadata Network science Resource description Standards, protocols Tweet connections Big data Community building Curation, quality control Discovery Filling gaps Linking archives Mobile devices Preservation Social sciences Systems, tools 4. PAPERS FOLLOWING This special issue includes, in order, the following works: 1. Robert S. Comer and Andrea J. Copeland. Methods for Capture of Social Media Content for Preservation in Memory Organizations 2. Mohamed M. G. Farag and Edward A. Fox. Building and archiving event web collections: A focused crawler approach 3. Zhiwu Xie, Prashant Chandrasekar and Edward Fox. A UWS Case for 200-Style Memento Negotiations 4. Luis Meneses, Sampath Jayarathna, Richard Furuta and Frank Shipman. Grading Degradation in an Institutionally Managed Repository 5. Sawood Alam, Michael L. Nelson, Herbert Van de Sompel, Lyudmila L. Balakireva, Harihar Shankar and David S. H. Rosenthal. Profiling Web Archives For Efficient Memento Query Routing 6. Smiljana Antonijevic and Ellysa Cahoy. Connecting the IR with the User: Personal Archiving via Zotero 7. Gerhard Gossen, Elena Demidova and Thomas Risse. The iCrawl System for Focused and Integrated Web Archive Crawling 8. Ian Milligan. Finding Community in the Ruins of GeoCities: Distantly Reading a Web Archive 9. Zhiwu Xie, Herbert Van de Sompel, Jinyang Liu, Johann van Reenen and Ramiro Jordan. Web Archiving Inconsistency: A Research Agenda 10. Tomas Foltyn and Martin Lhotak. The Czech Digital Library Fedora Commons based solution for aggregation, reuse, dissemination and archiving of digital documents 11. Tarek Kanan, Sagnik Ray Chowdhury, C. Lee Giles, Prashant Chandrasekar, and Edward A. Fox. Digital Library and Archiving for Qatar 12. Todd Suomela. Analytics for Monitoring Usage and Users of Archive-It Collections 5. RELATED EVENTS There have been a number of closely related events preceding this workshop, such as: • Working with Internet Archives for Research (WIRE 2014) NSF workshop, 17-18 June 2014, Cambridge, MA – see http://wp.comminfo.rutgers.edu/nsfia/ • Web Archiving and Digital Libraries (WADL’13), 25-26 July, at JCDL 2013, see http://www.ctrnet.net/sites/default/files/JCDL2013Workshop WebArchiving20130603.pdf and report in SIGIR Forum http://sigir.org/files/forum/2013D/p128.pdf • Web Archive Globalization Workshop, WAG 2011 – see http://cs.harding.edu/wag2011/, with 4 organizers plus 5 presenters and about 20 participants, held in Ottawa after JCDL 2011 (June 16-17) • Ongoing work by attendees in this area, growth in collaborative activity involving the Internet Archive, and specific community building successes like the Web Archive Cooperative – see http://infolab.stanford.edu/wac/ • Annual meetings of the International Internet Preservation Consortium (IIPC), partner meetings of the Internet Archive (Archive-It), and ten workshops held with ECDL/TPDL: International Web Archiving Workshop (IWAW), 2001-2010 6. ACKNOWLEDGEMENTS For their help with planning and reviewing, we thank the organizing committee: • Jefferson Bailey, Internet Archive, [email protected] • Prashant Chandrasekar, Virginia Tech, [email protected] • Mohamed Magdy Farig, Virginia Tech, [email protected] • Vinay Goel, Internet Archive, [email protected] • Frank McCown, Harding University, [email protected] • Michael Nelson, Old Dominion Univ., [email protected] • Andreas Rauber, TU Vienna, [email protected] • Matthew Weber, Rutgers, [email protected] This special issue arose in part because of work at numerous institutions supported by multiple sponsors. The first author of this paper was supported by the US National Science Foundation under Grant No. IIS-1319578 and by NPRP grant # 4-029-1-007 from the Qatar National Research Fund (a member of Qatar Foundation). The first and the second authors were also supported by the Incentive Awards of the Mellon Web Archiving Grant. The statements made herein are solely the responsibility of the authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Web Archiving and Digital Libraries (WADL) 2016: Highlights and Introduction to this Special Issue

This workshop, reported in the following 12 papers, explored the integration of Web archiving and digital libraries, so the complete life cycle involved was introduced: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, exploration (searching, browsing), archiving (of events), etc. It included particular coverage of current topics of interest, e.g., big dat...

متن کامل

The Role of Digital Reality Technologies in Libraries: A Systemic Review

Introduction: Fourth-generation libraries can no longer be satisfied with web software facilities and must use technologies, including digital realities to increase the level of service and attract clients. Therefore, this review aimed to identify the roles and effects of these technologies in libraries. Methods: In this systematic review, PubMed, Web of Science, Scopus, and Google Scholar adv...

متن کامل

WADL 2016 Panels: Worldwide activities on Web archiving; Social media, Web archiving, and digital libraries

In addition to presentations based around scholarly papers, the Web Archiving and Digital Libraries 2016 workshop also featured two panel discussion sessions. Each panel centered on a theme and featured short presentations by the panelists, followed by a moderated discussion and interaction with workshop participants; the panels are described herein. 1. PANEL 1: WORLDWIDE ACTIVITIES ON WEB ARCH...

متن کامل

New Challenges for the Archiving of Digital Writing

In his article "New Challenges for the Archiving of Digital Writing" Heiko Zimmermann discusses the challenges of the preservation of digital texts. In addition to the problems already at the focus of attention of digital archivists, there are elements in digital literature which need to be taken into consideration when trying to archive them. Zimmermann analyses two works of digital literature...

متن کامل

Data Mining Web Archives

Many institutions are now building rich, significant archives of web content. Though the number of web archiving programs has grown, access models for these collections have remained focused on URL-based discovery and traditional live-web-style browsing. Given the resources required to build and maintain web archives, finding new forms of access for these collection will help increase use and t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • TCDL Bulletin

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2015